Dataset statistics
| Number of variables | 25 |
|---|---|
| Number of observations | 10302 |
| Missing cells | 3004 |
| Missing cells (%) | 1.2% |
| Duplicate rows | 1 |
| Duplicate rows (%) | < 0.1% |
| Total size in memory | 2.0 MiB |
| Average record size in memory | 200.0 B |
Variable types
| Numeric | 9 |
|---|---|
| Categorical | 13 |
| Boolean | 3 |
| Dataset has 1 (< 0.1%) duplicate rows | Duplicates |
INCOME has a high cardinality: 8151 distinct values | High cardinality |
HOME_VAL has a high cardinality: 6334 distinct values | High cardinality |
BLUEBOOK has a high cardinality: 2985 distinct values | High cardinality |
OLDCLAIM has a high cardinality: 3545 distinct values | High cardinality |
CLM_AMT has a high cardinality: 2346 distinct values | High cardinality |
AGE is highly overall correlated with HOMEKIDS | High correlation |
HOMEKIDS is highly overall correlated with AGE and 1 other fields | High correlation |
PARENT1 is highly overall correlated with HOMEKIDS | High correlation |
GENDER is highly overall correlated with CAR_TYPE and 1 other fields | High correlation |
EDUCATION is highly overall correlated with OCCUPATION | High correlation |
OCCUPATION is highly overall correlated with EDUCATION and 1 other fields | High correlation |
CAR_USE is highly overall correlated with OCCUPATION and 1 other fields | High correlation |
CAR_TYPE is highly overall correlated with GENDER and 1 other fields | High correlation |
RED_CAR is highly overall correlated with GENDER | High correlation |
KIDSDRIV is highly imbalanced (71.1%) | Imbalance |
OLDCLAIM is highly imbalanced (53.1%) | Imbalance |
CLM_AMT is highly imbalanced (66.1%) | Imbalance |
YOJ has 548 (5.3%) missing values | Missing |
INCOME has 570 (5.5%) missing values | Missing |
HOME_VAL has 575 (5.6%) missing values | Missing |
OCCUPATION has 665 (6.5%) missing values | Missing |
CAR_AGE has 639 (6.2%) missing values | Missing |
HOMEKIDS has 6694 (65.0%) zeros | Zeros |
YOJ has 807 (7.8%) zeros | Zeros |
CLM_FREQ has 6292 (61.1%) zeros | Zeros |
MVR_PTS has 4658 (45.2%) zeros | Zeros |
Reproduction
| Analysis started | 2023-04-29 09:07:42.508155 |
|---|---|
| Analysis finished | 2023-04-29 09:08:00.515393 |
| Duration | 18.01 seconds |
| Software version | pandas-profiling v3.6.6 |
| Download configuration | config.json |
ID
Real number (ℝ)
| Distinct | 8753 |
|---|---|
| Distinct (%) | 85.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 4.9566311 × 108 |
| Minimum | 63175 |
|---|---|
| Maximum | 9.9992637 × 108 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 80.6 KiB |
Quantile statistics
| Minimum | 63175 |
|---|---|
| 5-th percentile | 50696156 |
| Q1 | 2.4428686 × 108 |
| median | 4.9700429 × 108 |
| Q3 | 7.3945507 × 108 |
| 95-th percentile | 9.4436522 × 108 |
| Maximum | 9.9992637 × 108 |
| Range | 9.9986319 × 108 |
| Interquartile range (IQR) | 4.9516821 × 108 |
Descriptive statistics
| Standard deviation | 2.8646748 × 108 |
|---|---|
| Coefficient of variation (CV) | 0.57794795 |
| Kurtosis | -1.1927241 |
| Mean | 4.9566311 × 108 |
| Median Absolute Deviation (MAD) | 2.4643025 × 108 |
| Skewness | 0.0050514277 |
| Sum | 5.1063213 × 1012 |
| Variance | 8.2063617 × 1016 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 341162899 | 5 | < 0.1% |
| 747557690 | 5 | < 0.1% |
| 173124759 | 5 | < 0.1% |
| 632067262 | 5 | < 0.1% |
| 750731752 | 4 | < 0.1% |
| 303183248 | 4 | < 0.1% |
| 132609655 | 4 | < 0.1% |
| 59026158 | 4 | < 0.1% |
| 983800811 | 4 | < 0.1% |
| 22340563 | 4 | < 0.1% |
| Other values (8743) | 10258 |
| Value | Count | Frequency (%) |
| 63175 | 1 | |
| 246910 | 1 | |
| 401276 | 1 | |
| 813128 | 2 | |
| 1307371 | 2 | |
| 1514697 | 1 | |
| 1541149 | 1 | |
| 1627973 | 1 | |
| 1780186 | 1 | |
| 1860885 | 1 |
| Value | Count | Frequency (%) |
| 999926368 | 2 | |
| 999800537 | 1 | |
| 999640290 | 1 | |
| 999577084 | 1 | |
| 999482663 | 1 | |
| 999457398 | 1 | |
| 999331839 | 1 | |
| 999178959 | 1 | |
| 999169190 | 1 | |
| 999158340 | 1 |
KIDSDRIV
Categorical
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 80.6 KiB |
| 0 | |
|---|---|
| 1 | 804 |
| 2 | 351 |
| 3 | 74 |
| 4 | 4 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 10302 |
|---|---|
| Distinct characters | 5 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 9069 | |
| 1 | 804 | 7.8% |
| 2 | 351 | 3.4% |
| 3 | 74 | 0.7% |
| 4 | 4 | < 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0 | 9069 | |
| 1 | 804 | 7.8% |
| 2 | 351 | 3.4% |
| 3 | 74 | 0.7% |
| 4 | 4 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 9069 | |
| 1 | 804 | 7.8% |
| 2 | 351 | 3.4% |
| 3 | 74 | 0.7% |
| 4 | 4 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 10302 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 9069 | |
| 1 | 804 | 7.8% |
| 2 | 351 | 3.4% |
| 3 | 74 | 0.7% |
| 4 | 4 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 10302 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 9069 | |
| 1 | 804 | 7.8% |
| 2 | 351 | 3.4% |
| 3 | 74 | 0.7% |
| 4 | 4 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 10302 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 9069 | |
| 1 | 804 | 7.8% |
| 2 | 351 | 3.4% |
| 3 | 74 | 0.7% |
| 4 | 4 | < 0.1% |
AGE
Real number (ℝ)
| Distinct | 61 |
|---|---|
| Distinct (%) | 0.6% |
| Missing | 7 |
| Missing (%) | 0.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 44.837397 |
| Minimum | 16 |
|---|---|
| Maximum | 81 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 80.6 KiB |
Quantile statistics
| Minimum | 16 |
|---|---|
| 5-th percentile | 30 |
| Q1 | 39 |
| median | 45 |
| Q3 | 51 |
| 95-th percentile | 59 |
| Maximum | 81 |
| Range | 65 |
| Interquartile range (IQR) | 12 |
Descriptive statistics
| Standard deviation | 8.606445 |
|---|---|
| Coefficient of variation (CV) | 0.19194792 |
| Kurtosis | -0.080902596 |
| Mean | 44.837397 |
| Median Absolute Deviation (MAD) | 6 |
| Skewness | -0.034540655 |
| Sum | 461601 |
| Variance | 74.070896 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 46 | 496 | 4.8% |
| 45 | 488 | 4.7% |
| 48 | 464 | 4.5% |
| 47 | 451 | 4.4% |
| 43 | 441 | 4.3% |
| 41 | 429 | 4.2% |
| 50 | 424 | 4.1% |
| 44 | 423 | 4.1% |
| 40 | 406 | 3.9% |
| 42 | 404 | 3.9% |
| Other values (51) | 5869 |
| Value | Count | Frequency (%) |
| 16 | 5 | < 0.1% |
| 17 | 2 | < 0.1% |
| 18 | 3 | < 0.1% |
| 19 | 8 | 0.1% |
| 20 | 4 | < 0.1% |
| 21 | 12 | 0.1% |
| 22 | 17 | |
| 23 | 12 | 0.1% |
| 24 | 25 | |
| 25 | 32 |
| Value | Count | Frequency (%) |
| 81 | 1 | < 0.1% |
| 80 | 1 | < 0.1% |
| 76 | 1 | < 0.1% |
| 73 | 4 | < 0.1% |
| 72 | 4 | < 0.1% |
| 71 | 1 | < 0.1% |
| 70 | 6 | 0.1% |
| 69 | 5 | < 0.1% |
| 68 | 8 | |
| 67 | 16 |
HOMEKIDS
Real number (ℝ)
HIGH CORRELATION  ZEROS 
| Distinct | 6 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.72044263 |
| Minimum | 0 |
|---|---|
| Maximum | 5 |
| Zeros | 6694 |
| Zeros (%) | 65.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 80.6 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 1 |
| 95-th percentile | 3 |
| Maximum | 5 |
| Range | 5 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 1.1163232 |
|---|---|
| Coefficient of variation (CV) | 1.5494963 |
| Kurtosis | 0.6293464 |
| Mean | 0.72044263 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 1.3366776 |
| Sum | 7422 |
| Variance | 1.2461775 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 6694 | |
| 2 | 1427 | 13.9% |
| 1 | 1106 | 10.7% |
| 3 | 856 | 8.3% |
| 4 | 201 | 2.0% |
| 5 | 18 | 0.2% |
| Value | Count | Frequency (%) |
| 0 | 6694 | |
| 1 | 1106 | 10.7% |
| 2 | 1427 | 13.9% |
| 3 | 856 | 8.3% |
| 4 | 201 | 2.0% |
| 5 | 18 | 0.2% |
| Value | Count | Frequency (%) |
| 5 | 18 | 0.2% |
| 4 | 201 | 2.0% |
| 3 | 856 | 8.3% |
| 2 | 1427 | 13.9% |
| 1 | 1106 | 10.7% |
| 0 | 6694 |
YOJ
Real number (ℝ)
MISSING  ZEROS 
| Distinct | 21 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 548 |
| Missing (%) | 5.3% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 10.474062 |
| Minimum | 0 |
|---|---|
| Maximum | 23 |
| Zeros | 807 |
| Zeros (%) | 7.8% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 80.6 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 9 |
| median | 11 |
| Q3 | 13 |
| 95-th percentile | 15 |
| Maximum | 23 |
| Range | 23 |
| Interquartile range (IQR) | 4 |
Descriptive statistics
| Standard deviation | 4.1089432 |
|---|---|
| Coefficient of variation (CV) | 0.39229701 |
| Kurtosis | 1.1448021 |
| Mean | 10.474062 |
| Median Absolute Deviation (MAD) | 2 |
| Skewness | -1.2008229 |
| Sum | 102164 |
| Variance | 16.883414 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 12 | 1500 | |
| 11 | 1267 | |
| 13 | 1266 | |
| 14 | 996 | |
| 10 | 934 | |
| 0 | 807 | |
| 9 | 653 | |
| 15 | 583 | 5.7% |
| 8 | 484 | 4.7% |
| 7 | 384 | 3.7% |
| Other values (11) | 880 | |
| (Missing) | 548 | 5.3% |
| Value | Count | Frequency (%) |
| 0 | 807 | |
| 1 | 7 | 0.1% |
| 2 | 21 | 0.2% |
| 3 | 38 | 0.4% |
| 4 | 49 | 0.5% |
| 5 | 124 | 1.2% |
| 6 | 219 | 2.1% |
| 7 | 384 | |
| 8 | 484 | |
| 9 | 653 |
| Value | Count | Frequency (%) |
| 23 | 2 | < 0.1% |
| 19 | 17 | 0.2% |
| 18 | 33 | 0.3% |
| 17 | 127 | 1.2% |
| 16 | 243 | 2.4% |
| 15 | 583 | 5.7% |
| 14 | 996 | |
| 13 | 1266 | |
| 12 | 1500 | |
| 11 | 1267 |
INCOME
Categorical
HIGH CARDINALITY  MISSING 
| Distinct | 8151 |
|---|---|
| Distinct (%) | 83.8% |
| Missing | 570 |
| Missing (%) | 5.5% |
| Memory size | 80.6 KiB |
| $0 | 797 |
|---|---|
| $61,790 | 5 |
| $64,916 | 4 |
| $48,509 | 4 |
| $30,111 | 4 |
| Other values (8146) |
Length
| Max length | 8 |
|---|---|
| Median length | 7 |
| Mean length | 6.714961 |
| Min length | 2 |
Characters and Unicode
| Total characters | 65350 |
|---|---|
| Distinct characters | 12 |
| Distinct categories | 3 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 7443 ? |
|---|---|
| Unique (%) | 76.5% |
Sample
| 1st row | $67,349 |
|---|---|
| 2nd row | $91,449 |
| 3rd row | $52,881 |
| 4th row | $16,039 |
| 5th row | $114,986 |
Common Values
| Value | Count | Frequency (%) |
| $0 | 797 | 7.7% |
| $61,790 | 5 | < 0.1% |
| $64,916 | 4 | < 0.1% |
| $48,509 | 4 | < 0.1% |
| $30,111 | 4 | < 0.1% |
| $43,393 | 4 | < 0.1% |
| $26,840 | 4 | < 0.1% |
| $38,290 | 3 | < 0.1% |
| $2,346 | 3 | < 0.1% |
| $82,398 | 3 | < 0.1% |
| Other values (8141) | 8901 | |
| (Missing) | 570 | 5.5% |
Length
| Value | Count | Frequency (%) |
| 0 | 797 | 8.2% |
| 61,790 | 5 | 0.1% |
| 64,916 | 4 | < 0.1% |
| 48,509 | 4 | < 0.1% |
| 30,111 | 4 | < 0.1% |
| 43,393 | 4 | < 0.1% |
| 26,840 | 4 | < 0.1% |
| 107,375 | 3 | < 0.1% |
| 47,513 | 3 | < 0.1% |
| 19,599 | 3 | < 0.1% |
| Other values (8141) | 8901 |
Most occurring characters
| Value | Count | Frequency (%) |
| $ | 9732 | |
| , | 8884 | |
| 1 | 6016 | |
| 2 | 4914 | |
| 3 | 4851 | |
| 0 | 4799 | |
| 4 | 4642 | |
| 5 | 4597 | |
| 6 | 4531 | |
| 7 | 4252 | |
| Other values (2) | 8132 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 46734 | |
| Currency Symbol | 9732 | 14.9% |
| Other Punctuation | 8884 | 13.6% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 6016 | |
| 2 | 4914 | |
| 3 | 4851 | |
| 0 | 4799 | |
| 4 | 4642 | |
| 5 | 4597 | |
| 6 | 4531 | |
| 7 | 4252 | |
| 9 | 4073 | |
| 8 | 4059 |
Currency Symbol
| Value | Count | Frequency (%) |
| $ | 9732 |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 8884 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 65350 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| $ | 9732 | |
| , | 8884 | |
| 1 | 6016 | |
| 2 | 4914 | |
| 3 | 4851 | |
| 0 | 4799 | |
| 4 | 4642 | |
| 5 | 4597 | |
| 6 | 4531 | |
| 7 | 4252 | |
| Other values (2) | 8132 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 65350 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| $ | 9732 | |
| , | 8884 | |
| 1 | 6016 | |
| 2 | 4914 | |
| 3 | 4851 | |
| 0 | 4799 | |
| 4 | 4642 | |
| 5 | 4597 | |
| 6 | 4531 | |
| 7 | 4252 | |
| Other values (2) | 8132 |
PARENT1
Boolean
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 10.2 KiB |
| False | |
|---|---|
| True |
| Value | Count | Frequency (%) |
| False | 8959 | |
| True | 1343 | 13.0% |
HOME_VAL
Categorical
HIGH CARDINALITY  MISSING 
| Distinct | 6334 |
|---|---|
| Distinct (%) | 65.1% |
| Missing | 575 |
| Missing (%) | 5.6% |
| Memory size | 80.6 KiB |
| $0 | |
|---|---|
| $151,286 | 3 |
| $214,584 | 3 |
| $159,568 | 3 |
| $167,505 | 3 |
| Other values (6329) |
Length
| Max length | 8 |
|---|---|
| Median length | 8 |
| Mean length | 6.1581166 |
| Min length | 2 |
Characters and Unicode
| Total characters | 59900 |
|---|---|
| Distinct characters | 12 |
| Distinct categories | 3 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 5883 ? |
|---|---|
| Unique (%) | 60.5% |
Sample
| 1st row | $0 |
|---|---|
| 2nd row | $257,252 |
| 3rd row | $0 |
| 4th row | $124,191 |
| 5th row | $306,251 |
Common Values
| Value | Count | Frequency (%) |
| $0 | 2908 | |
| $151,286 | 3 | < 0.1% |
| $214,584 | 3 | < 0.1% |
| $159,568 | 3 | < 0.1% |
| $167,505 | 3 | < 0.1% |
| $99,103 | 3 | < 0.1% |
| $332,673 | 3 | < 0.1% |
| $166,481 | 3 | < 0.1% |
| $178,852 | 3 | < 0.1% |
| $165,641 | 3 | < 0.1% |
| Other values (6324) | 6792 | |
| (Missing) | 575 | 5.6% |
Length
| Value | Count | Frequency (%) |
| 0 | 2908 | |
| 225,111 | 3 | < 0.1% |
| 121,949 | 3 | < 0.1% |
| 153,061 | 3 | < 0.1% |
| 117,038 | 3 | < 0.1% |
| 196,320 | 3 | < 0.1% |
| 176,219 | 3 | < 0.1% |
| 154,672 | 3 | < 0.1% |
| 244,764 | 3 | < 0.1% |
| 288,592 | 3 | < 0.1% |
| Other values (6324) | 6792 |
Most occurring characters
| Value | Count | Frequency (%) |
| $ | 9727 | |
| , | 6819 | |
| 0 | 6361 | |
| 1 | 6141 | |
| 2 | 5872 | |
| 3 | 4193 | |
| 4 | 3591 | 6.0% |
| 5 | 3496 | 5.8% |
| 8 | 3447 | 5.8% |
| 6 | 3432 | 5.7% |
| Other values (2) | 6821 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 43354 | |
| Currency Symbol | 9727 | 16.2% |
| Other Punctuation | 6819 | 11.4% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 6361 | |
| 1 | 6141 | |
| 2 | 5872 | |
| 3 | 4193 | |
| 4 | 3591 | |
| 5 | 3496 | |
| 8 | 3447 | |
| 6 | 3432 | |
| 9 | 3426 | |
| 7 | 3395 |
Currency Symbol
| Value | Count | Frequency (%) |
| $ | 9727 |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 6819 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 59900 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| $ | 9727 | |
| , | 6819 | |
| 0 | 6361 | |
| 1 | 6141 | |
| 2 | 5872 | |
| 3 | 4193 | |
| 4 | 3591 | 6.0% |
| 5 | 3496 | 5.8% |
| 8 | 3447 | 5.8% |
| 6 | 3432 | 5.7% |
| Other values (2) | 6821 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 59900 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| $ | 9727 | |
| , | 6819 | |
| 0 | 6361 | |
| 1 | 6141 | |
| 2 | 5872 | |
| 3 | 4193 | |
| 4 | 3591 | 6.0% |
| 5 | 3496 | 5.8% |
| 8 | 3447 | 5.8% |
| 6 | 3432 | 5.7% |
| Other values (2) | 6821 |
MSTATUS
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 80.6 KiB |
| Yes | |
|---|---|
| z_No |
Length
| Max length | 4 |
|---|---|
| Median length | 3 |
| Mean length | 3.3993399 |
| Min length | 3 |
Characters and Unicode
| Total characters | 35020 |
|---|---|
| Distinct characters | 7 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | z_No |
|---|---|
| 2nd row | z_No |
| 3rd row | z_No |
| 4th row | Yes |
| 5th row | Yes |
Common Values
| Value | Count | Frequency (%) |
| Yes | 6188 | |
| z_No | 4114 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| yes | 6188 | |
| z_no | 4114 |
Most occurring characters
| Value | Count | Frequency (%) |
| Y | 6188 | |
| e | 6188 | |
| s | 6188 | |
| z | 4114 | |
| _ | 4114 | |
| N | 4114 | |
| o | 4114 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 20604 | |
| Uppercase Letter | 10302 | |
| Connector Punctuation | 4114 | 11.7% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 6188 | |
| s | 6188 | |
| z | 4114 | |
| o | 4114 |
Uppercase Letter
| Value | Count | Frequency (%) |
| Y | 6188 | |
| N | 4114 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 4114 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 30906 | |
| Common | 4114 | 11.7% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| Y | 6188 | |
| e | 6188 | |
| s | 6188 | |
| z | 4114 | |
| N | 4114 | |
| o | 4114 |
Common
| Value | Count | Frequency (%) |
| _ | 4114 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 35020 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| Y | 6188 | |
| e | 6188 | |
| s | 6188 | |
| z | 4114 | |
| _ | 4114 | |
| N | 4114 | |
| o | 4114 |
GENDER
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 80.6 KiB |
| z_F | |
|---|---|
| M |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 2.07649 |
| Min length | 1 |
Characters and Unicode
| Total characters | 21392 |
|---|---|
| Distinct characters | 4 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | M |
|---|---|
| 2nd row | M |
| 3rd row | M |
| 4th row | z_F |
| 5th row | M |
Common Values
| Value | Count | Frequency (%) |
| z_F | 5545 | |
| M | 4757 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| z_f | 5545 | |
| m | 4757 |
Most occurring characters
| Value | Count | Frequency (%) |
| z | 5545 | |
| _ | 5545 | |
| F | 5545 | |
| M | 4757 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 10302 | |
| Lowercase Letter | 5545 | |
| Connector Punctuation | 5545 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| F | 5545 | |
| M | 4757 |
Lowercase Letter
| Value | Count | Frequency (%) |
| z | 5545 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 5545 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 15847 | |
| Common | 5545 | 25.9% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| z | 5545 | |
| F | 5545 | |
| M | 4757 |
Common
| Value | Count | Frequency (%) |
| _ | 5545 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 21392 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| z | 5545 | |
| _ | 5545 | |
| F | 5545 | |
| M | 4757 |
EDUCATION
Categorical
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 80.6 KiB |
| z_High School | |
|---|---|
| Bachelors | |
| Masters | |
| <High School | |
| PhD |
Length
| Max length | 13 |
|---|---|
| Median length | 12 |
| Mean length | 9.6399728 |
| Min length | 3 |
Characters and Unicode
| Total characters | 99311 |
|---|---|
| Distinct characters | 21 |
| Distinct categories | 5 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | PhD |
|---|---|
| 2nd row | z_High School |
| 3rd row | Bachelors |
| 4th row | z_High School |
| 5th row | <High School |
Common Values
| Value | Count | Frequency (%) |
| z_High School | 2952 | |
| Bachelors | 2823 | |
| Masters | 2078 | |
| <High School | 1515 | |
| PhD | 934 | 9.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| school | 4467 | |
| z_high | 2952 | |
| bachelors | 2823 | |
| masters | 2078 | |
| high | 1515 | 10.3% |
| phd | 934 | 6.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| h | 12691 | |
| o | 11757 | 11.8% |
| l | 7290 | 7.3% |
| c | 7290 | 7.3% |
| s | 6979 | 7.0% |
| e | 4901 | 4.9% |
| r | 4901 | 4.9% |
| a | 4901 | 4.9% |
| H | 4467 | 4.5% |
| i | 4467 | 4.5% |
| Other values (11) | 29667 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 74674 | |
| Uppercase Letter | 15703 | 15.8% |
| Space Separator | 4467 | 4.5% |
| Connector Punctuation | 2952 | 3.0% |
| Math Symbol | 1515 | 1.5% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| h | 12691 | |
| o | 11757 | |
| l | 7290 | |
| c | 7290 | |
| s | 6979 | |
| e | 4901 | 6.6% |
| r | 4901 | 6.6% |
| a | 4901 | 6.6% |
| i | 4467 | 6.0% |
| g | 4467 | 6.0% |
| Other values (2) | 5030 | 6.7% |
Uppercase Letter
| Value | Count | Frequency (%) |
| H | 4467 | |
| S | 4467 | |
| B | 2823 | |
| M | 2078 | |
| P | 934 | 5.9% |
| D | 934 | 5.9% |
Space Separator
| Value | Count | Frequency (%) |
| 4467 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 2952 |
Math Symbol
| Value | Count | Frequency (%) |
| < | 1515 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 90377 | |
| Common | 8934 | 9.0% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| h | 12691 | |
| o | 11757 | |
| l | 7290 | 8.1% |
| c | 7290 | 8.1% |
| s | 6979 | 7.7% |
| e | 4901 | 5.4% |
| r | 4901 | 5.4% |
| a | 4901 | 5.4% |
| H | 4467 | 4.9% |
| i | 4467 | 4.9% |
| Other values (8) | 20733 |
Common
| Value | Count | Frequency (%) |
| 4467 | ||
| _ | 2952 | |
| < | 1515 | 17.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 99311 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| h | 12691 | |
| o | 11757 | 11.8% |
| l | 7290 | 7.3% |
| c | 7290 | 7.3% |
| s | 6979 | 7.0% |
| e | 4901 | 4.9% |
| r | 4901 | 4.9% |
| a | 4901 | 4.9% |
| H | 4467 | 4.5% |
| i | 4467 | 4.5% |
| Other values (11) | 29667 |
OCCUPATION
Categorical
HIGH CORRELATION  MISSING 
| Distinct | 8 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 665 |
| Missing (%) | 6.5% |
| Memory size | 80.6 KiB |
| z_Blue Collar | |
|---|---|
| Clerical | |
| Professional | |
| Manager | |
| Lawyer | |
| Other values (3) |
Length
| Max length | 13 |
|---|---|
| Median length | 10 |
| Mean length | 9.44215 |
| Min length | 6 |
Characters and Unicode
| Total characters | 90994 |
|---|---|
| Distinct characters | 29 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Professional |
|---|---|
| 2nd row | z_Blue Collar |
| 3rd row | Manager |
| 4th row | Clerical |
| 5th row | z_Blue Collar |
Common Values
| Value | Count | Frequency (%) |
| z_Blue Collar | 2288 | |
| Clerical | 1590 | |
| Professional | 1408 | |
| Manager | 1257 | |
| Lawyer | 1031 | |
| Student | 899 | 8.7% |
| Home Maker | 843 | 8.2% |
| Doctor | 321 | 3.1% |
| (Missing) | 665 | 6.5% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| z_blue | 2288 | |
| collar | 2288 | |
| clerical | 1590 | |
| professional | 1408 | |
| manager | 1257 | |
| lawyer | 1031 | |
| student | 899 | 7.0% |
| home | 843 | 6.6% |
| maker | 843 | 6.6% |
| doctor | 321 | 2.5% |
Most occurring characters
| Value | Count | Frequency (%) |
| l | 11452 | |
| e | 10159 | 11.2% |
| a | 9674 | 10.6% |
| r | 8738 | 9.6% |
| o | 6589 | 7.2% |
| C | 3878 | 4.3% |
| n | 3564 | 3.9% |
| u | 3187 | 3.5% |
| 3131 | 3.4% | |
| i | 2998 | 3.3% |
| Other values (19) | 27624 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 72807 | |
| Uppercase Letter | 12768 | 14.0% |
| Space Separator | 3131 | 3.4% |
| Connector Punctuation | 2288 | 2.5% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| l | 11452 | |
| e | 10159 | |
| a | 9674 | |
| r | 8738 | |
| o | 6589 | |
| n | 3564 | 4.9% |
| u | 3187 | 4.4% |
| i | 2998 | 4.1% |
| s | 2816 | 3.9% |
| z | 2288 | 3.1% |
| Other values (9) | 11342 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 3878 | |
| B | 2288 | |
| M | 2100 | |
| P | 1408 | 11.0% |
| L | 1031 | 8.1% |
| S | 899 | 7.0% |
| H | 843 | 6.6% |
| D | 321 | 2.5% |
Space Separator
| Value | Count | Frequency (%) |
| 3131 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 2288 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 85575 | |
| Common | 5419 | 6.0% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| l | 11452 | |
| e | 10159 | |
| a | 9674 | |
| r | 8738 | 10.2% |
| o | 6589 | 7.7% |
| C | 3878 | 4.5% |
| n | 3564 | 4.2% |
| u | 3187 | 3.7% |
| i | 2998 | 3.5% |
| s | 2816 | 3.3% |
| Other values (17) | 22520 |
Common
| Value | Count | Frequency (%) |
| 3131 | ||
| _ | 2288 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 90994 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| l | 11452 | |
| e | 10159 | 11.2% |
| a | 9674 | 10.6% |
| r | 8738 | 9.6% |
| o | 6589 | 7.2% |
| C | 3878 | 4.3% |
| n | 3564 | 3.9% |
| u | 3187 | 3.5% |
| 3131 | 3.4% | |
| i | 2998 | 3.3% |
| Other values (19) | 27624 |
TRAVTIME
Real number (ℝ)
| Distinct | 100 |
|---|---|
| Distinct (%) | 1.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 33.416424 |
| Minimum | 5 |
|---|---|
| Maximum | 142 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 80.6 KiB |
Quantile statistics
| Minimum | 5 |
|---|---|
| 5-th percentile | 7 |
| Q1 | 22 |
| median | 33 |
| Q3 | 44 |
| 95-th percentile | 60 |
| Maximum | 142 |
| Range | 137 |
| Interquartile range (IQR) | 22 |
Descriptive statistics
| Standard deviation | 15.869687 |
|---|---|
| Coefficient of variation (CV) | 0.4749068 |
| Kurtosis | 0.59465625 |
| Mean | 33.416424 |
| Median Absolute Deviation (MAD) | 11 |
| Skewness | 0.43552716 |
| Sum | 344256 |
| Variance | 251.84696 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 5 | 427 | 4.1% |
| 32 | 288 | 2.8% |
| 35 | 271 | 2.6% |
| 33 | 268 | 2.6% |
| 36 | 266 | 2.6% |
| 30 | 265 | 2.6% |
| 37 | 264 | 2.6% |
| 29 | 259 | 2.5% |
| 25 | 257 | 2.5% |
| 24 | 253 | 2.5% |
| Other values (90) | 7484 |
| Value | Count | Frequency (%) |
| 5 | 427 | |
| 6 | 66 | 0.6% |
| 7 | 56 | 0.5% |
| 8 | 69 | 0.7% |
| 9 | 86 | 0.8% |
| 10 | 101 | 1.0% |
| 11 | 90 | 0.9% |
| 12 | 122 | 1.2% |
| 13 | 121 | 1.2% |
| 14 | 133 | 1.3% |
| Value | Count | Frequency (%) |
| 142 | 1 | |
| 134 | 1 | |
| 124 | 1 | |
| 113 | 1 | |
| 105 | 1 | |
| 103 | 1 | |
| 101 | 1 | |
| 99 | 1 | |
| 98 | 1 | |
| 97 | 2 |
CAR_USE
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 80.6 KiB |
| Private | |
|---|---|
| Commercial |
Length
| Max length | 10 |
|---|---|
| Median length | 7 |
| Mean length | 8.103378 |
| Min length | 7 |
Characters and Unicode
| Total characters | 83481 |
|---|---|
| Distinct characters | 12 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Private |
|---|---|
| 2nd row | Commercial |
| 3rd row | Private |
| 4th row | Private |
| 5th row | Private |
Common Values
| Value | Count | Frequency (%) |
| Private | 6513 | |
| Commercial | 3789 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| private | 6513 | |
| commercial | 3789 |
Most occurring characters
| Value | Count | Frequency (%) |
| r | 10302 | |
| i | 10302 | |
| a | 10302 | |
| e | 10302 | |
| m | 7578 | |
| P | 6513 | |
| v | 6513 | |
| t | 6513 | |
| C | 3789 | 4.5% |
| o | 3789 | 4.5% |
| Other values (2) | 7578 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 73179 | |
| Uppercase Letter | 10302 | 12.3% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| r | 10302 | |
| i | 10302 | |
| a | 10302 | |
| e | 10302 | |
| m | 7578 | |
| v | 6513 | |
| t | 6513 | |
| o | 3789 | 5.2% |
| c | 3789 | 5.2% |
| l | 3789 | 5.2% |
Uppercase Letter
| Value | Count | Frequency (%) |
| P | 6513 | |
| C | 3789 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 83481 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| r | 10302 | |
| i | 10302 | |
| a | 10302 | |
| e | 10302 | |
| m | 7578 | |
| P | 6513 | |
| v | 6513 | |
| t | 6513 | |
| C | 3789 | 4.5% |
| o | 3789 | 4.5% |
| Other values (2) | 7578 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 83481 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| r | 10302 | |
| i | 10302 | |
| a | 10302 | |
| e | 10302 | |
| m | 7578 | |
| P | 6513 | |
| v | 6513 | |
| t | 6513 | |
| C | 3789 | 4.5% |
| o | 3789 | 4.5% |
| Other values (2) | 7578 |
BLUEBOOK
Categorical
| Distinct | 2985 |
|---|---|
| Distinct (%) | 29.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 80.6 KiB |
| $1,500 | 207 |
|---|---|
| $6,200 | 47 |
| $6,000 | 42 |
| $5,800 | 39 |
| $5,400 | 38 |
| Other values (2980) |
Length
| Max length | 7 |
|---|---|
| Median length | 7 |
| Mean length | 6.7128713 |
| Min length | 6 |
Characters and Unicode
| Total characters | 69156 |
|---|---|
| Distinct characters | 12 |
| Distinct categories | 3 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 834 ? |
|---|---|
| Unique (%) | 8.1% |
Sample
| 1st row | $14,230 |
|---|---|
| 2nd row | $14,940 |
| 3rd row | $21,970 |
| 4th row | $4,010 |
| 5th row | $15,440 |
Common Values
| Value | Count | Frequency (%) |
| $1,500 | 207 | 2.0% |
| $6,200 | 47 | 0.5% |
| $6,000 | 42 | 0.4% |
| $5,800 | 39 | 0.4% |
| $5,400 | 38 | 0.4% |
| $5,600 | 38 | 0.4% |
| $5,900 | 37 | 0.4% |
| $5,700 | 36 | 0.3% |
| $6,500 | 36 | 0.3% |
| $6,400 | 35 | 0.3% |
| Other values (2975) | 9747 |
Length
| Value | Count | Frequency (%) |
| 1,500 | 207 | 2.0% |
| 6,200 | 47 | 0.5% |
| 6,000 | 42 | 0.4% |
| 5,800 | 39 | 0.4% |
| 5,400 | 38 | 0.4% |
| 5,600 | 38 | 0.4% |
| 5,900 | 37 | 0.4% |
| 5,700 | 36 | 0.3% |
| 6,500 | 36 | 0.3% |
| 6,100 | 35 | 0.3% |
| Other values (2975) | 9747 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 14152 | |
| $ | 10302 | |
| , | 10302 | |
| 1 | 7572 | |
| 2 | 5057 | 7.3% |
| 3 | 3419 | 4.9% |
| 5 | 3323 | 4.8% |
| 6 | 3148 | 4.6% |
| 4 | 3059 | 4.4% |
| 7 | 3047 | 4.4% |
| Other values (2) | 5775 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 48552 | |
| Currency Symbol | 10302 | 14.9% |
| Other Punctuation | 10302 | 14.9% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 14152 | |
| 1 | 7572 | |
| 2 | 5057 | 10.4% |
| 3 | 3419 | 7.0% |
| 5 | 3323 | 6.8% |
| 6 | 3148 | 6.5% |
| 4 | 3059 | 6.3% |
| 7 | 3047 | 6.3% |
| 8 | 2932 | 6.0% |
| 9 | 2843 | 5.9% |
Currency Symbol
| Value | Count | Frequency (%) |
| $ | 10302 |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 10302 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 69156 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 14152 | |
| $ | 10302 | |
| , | 10302 | |
| 1 | 7572 | |
| 2 | 5057 | 7.3% |
| 3 | 3419 | 4.9% |
| 5 | 3323 | 4.8% |
| 6 | 3148 | 4.6% |
| 4 | 3059 | 4.4% |
| 7 | 3047 | 4.4% |
| Other values (2) | 5775 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 69156 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 14152 | |
| $ | 10302 | |
| , | 10302 | |
| 1 | 7572 | |
| 2 | 5057 | 7.3% |
| 3 | 3419 | 4.9% |
| 5 | 3323 | 4.8% |
| 6 | 3148 | 4.6% |
| 4 | 3059 | 4.4% |
| 7 | 3047 | 4.4% |
| Other values (2) | 5775 |
TIF
Real number (ℝ)
| Distinct | 23 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 5.3291594 |
| Minimum | 1 |
|---|---|
| Maximum | 25 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 80.6 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| median | 4 |
| Q3 | 7 |
| 95-th percentile | 13 |
| Maximum | 25 |
| Range | 24 |
| Interquartile range (IQR) | 6 |
Descriptive statistics
| Standard deviation | 4.1107947 |
|---|---|
| Coefficient of variation (CV) | 0.7713777 |
| Kurtosis | 0.47970857 |
| Mean | 5.3291594 |
| Median Absolute Deviation (MAD) | 3 |
| Skewness | 0.89941526 |
| Sum | 54901 |
| Variance | 16.898633 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 3172 | |
| 6 | 1707 | |
| 4 | 1616 | |
| 10 | 951 | 9.2% |
| 7 | 781 | 7.6% |
| 3 | 531 | 5.2% |
| 13 | 355 | 3.4% |
| 11 | 300 | 2.9% |
| 9 | 299 | 2.9% |
| 17 | 126 | 1.2% |
| Other values (13) | 464 | 4.5% |
| Value | Count | Frequency (%) |
| 1 | 3172 | |
| 2 | 6 | 0.1% |
| 3 | 531 | 5.2% |
| 4 | 1616 | |
| 5 | 70 | 0.7% |
| 6 | 1707 | |
| 7 | 781 | 7.6% |
| 8 | 83 | 0.8% |
| 9 | 299 | 2.9% |
| 10 | 951 | 9.2% |
| Value | Count | Frequency (%) |
| 25 | 3 | < 0.1% |
| 22 | 3 | < 0.1% |
| 21 | 13 | 0.1% |
| 20 | 12 | 0.1% |
| 19 | 11 | 0.1% |
| 18 | 26 | 0.3% |
| 17 | 126 | |
| 16 | 50 | 0.5% |
| 15 | 40 | 0.4% |
| 14 | 92 |
CAR_TYPE
Categorical
| Distinct | 6 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 80.6 KiB |
| z_SUV | |
|---|---|
| Minivan | |
| Pickup | |
| Sports Car | |
| Van |
Length
| Max length | 11 |
|---|---|
| Median length | 10 |
| Mean length | 6.5852262 |
| Min length | 3 |
Characters and Unicode
| Total characters | 67841 |
|---|---|
| Distinct characters | 24 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Minivan |
|---|---|
| 2nd row | Minivan |
| 3rd row | Van |
| 4th row | z_SUV |
| 5th row | Minivan |
Common Values
| Value | Count | Frequency (%) |
| z_SUV | 2883 | |
| Minivan | 2694 | |
| Pickup | 1772 | |
| Sports Car | 1179 | |
| Van | 921 | 8.9% |
| Panel Truck | 853 | 8.3% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| z_suv | 2883 | |
| minivan | 2694 | |
| pickup | 1772 | |
| sports | 1179 | |
| car | 1179 | |
| van | 921 | 7.5% |
| panel | 853 | 6.9% |
| truck | 853 | 6.9% |
Most occurring characters
| Value | Count | Frequency (%) |
| n | 7162 | 10.6% |
| i | 7160 | 10.6% |
| a | 5647 | 8.3% |
| S | 4062 | 6.0% |
| V | 3804 | 5.6% |
| r | 3211 | 4.7% |
| p | 2951 | 4.3% |
| z | 2883 | 4.2% |
| U | 2883 | 4.2% |
| _ | 2883 | 4.2% |
| Other values (14) | 25195 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 44826 | |
| Uppercase Letter | 18100 | |
| Connector Punctuation | 2883 | 4.2% |
| Space Separator | 2032 | 3.0% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| n | 7162 | |
| i | 7160 | |
| a | 5647 | |
| r | 3211 | |
| p | 2951 | |
| z | 2883 | |
| v | 2694 | 6.0% |
| u | 2625 | 5.9% |
| k | 2625 | 5.9% |
| c | 2625 | 5.9% |
| Other values (5) | 5243 |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 4062 | |
| V | 3804 | |
| U | 2883 | |
| M | 2694 | |
| P | 2625 | |
| C | 1179 | 6.5% |
| T | 853 | 4.7% |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 2883 |
Space Separator
| Value | Count | Frequency (%) |
| 2032 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 62926 | |
| Common | 4915 | 7.2% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| n | 7162 | 11.4% |
| i | 7160 | 11.4% |
| a | 5647 | 9.0% |
| S | 4062 | 6.5% |
| V | 3804 | 6.0% |
| r | 3211 | 5.1% |
| p | 2951 | 4.7% |
| z | 2883 | 4.6% |
| U | 2883 | 4.6% |
| M | 2694 | 4.3% |
| Other values (12) | 20469 |
Common
| Value | Count | Frequency (%) |
| _ | 2883 | |
| 2032 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 67841 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| n | 7162 | 10.6% |
| i | 7160 | 10.6% |
| a | 5647 | 8.3% |
| S | 4062 | 6.0% |
| V | 3804 | 5.6% |
| r | 3211 | 4.7% |
| p | 2951 | 4.3% |
| z | 2883 | 4.2% |
| U | 2883 | 4.2% |
| _ | 2883 | 4.2% |
| Other values (14) | 25195 |
RED_CAR
Boolean
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 10.2 KiB |
| False | |
|---|---|
| True |
| Value | Count | Frequency (%) |
| False | 7326 | |
| True | 2976 |
OLDCLAIM
Categorical
HIGH CARDINALITY  IMBALANCE 
| Distinct | 3545 |
|---|---|
| Distinct (%) | 34.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 80.6 KiB |
| $0 | |
|---|---|
| $1,310 | 4 |
| $4,448 | 4 |
| $1,391 | 4 |
| $4,188 | 4 |
| Other values (3540) |
Length
| Max length | 7 |
|---|---|
| Median length | 2 |
| Mean length | 3.6257037 |
| Min length | 2 |
Characters and Unicode
| Total characters | 37352 |
|---|---|
| Distinct characters | 12 |
| Distinct categories | 3 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 3130 ? |
|---|---|
| Unique (%) | 30.4% |
Sample
| 1st row | $4,461 |
|---|---|
| 2nd row | $0 |
| 3rd row | $0 |
| 4th row | $38,690 |
| 5th row | $0 |
Common Values
| Value | Count | Frequency (%) |
| $0 | 6292 | |
| $1,310 | 4 | < 0.1% |
| $4,448 | 4 | < 0.1% |
| $1,391 | 4 | < 0.1% |
| $4,188 | 4 | < 0.1% |
| $4,538 | 4 | < 0.1% |
| $4,263 | 4 | < 0.1% |
| $1,105 | 4 | < 0.1% |
| $3,960 | 3 | < 0.1% |
| $3,068 | 3 | < 0.1% |
| Other values (3535) | 3976 |
Length
| Value | Count | Frequency (%) |
| 0 | 6292 | |
| 4,448 | 4 | < 0.1% |
| 1,391 | 4 | < 0.1% |
| 4,188 | 4 | < 0.1% |
| 4,538 | 4 | < 0.1% |
| 4,263 | 4 | < 0.1% |
| 1,105 | 4 | < 0.1% |
| 1,310 | 4 | < 0.1% |
| 3,338 | 3 | < 0.1% |
| 6,985 | 3 | < 0.1% |
| Other values (3535) | 3976 |
Most occurring characters
| Value | Count | Frequency (%) |
| $ | 10302 | |
| 0 | 7636 | |
| , | 3882 | 10.4% |
| 3 | 2012 | 5.4% |
| 1 | 1963 | 5.3% |
| 4 | 1815 | 4.9% |
| 5 | 1769 | 4.7% |
| 2 | 1763 | 4.7% |
| 6 | 1626 | 4.4% |
| 8 | 1552 | 4.2% |
| Other values (2) | 3032 | 8.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 23168 | |
| Currency Symbol | 10302 | |
| Other Punctuation | 3882 | 10.4% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 7636 | |
| 3 | 2012 | 8.7% |
| 1 | 1963 | 8.5% |
| 4 | 1815 | 7.8% |
| 5 | 1769 | 7.6% |
| 2 | 1763 | 7.6% |
| 6 | 1626 | 7.0% |
| 8 | 1552 | 6.7% |
| 7 | 1546 | 6.7% |
| 9 | 1486 | 6.4% |
Currency Symbol
| Value | Count | Frequency (%) |
| $ | 10302 |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 3882 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 37352 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| $ | 10302 | |
| 0 | 7636 | |
| , | 3882 | 10.4% |
| 3 | 2012 | 5.4% |
| 1 | 1963 | 5.3% |
| 4 | 1815 | 4.9% |
| 5 | 1769 | 4.7% |
| 2 | 1763 | 4.7% |
| 6 | 1626 | 4.4% |
| 8 | 1552 | 4.2% |
| Other values (2) | 3032 | 8.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 37352 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| $ | 10302 | |
| 0 | 7636 | |
| , | 3882 | 10.4% |
| 3 | 2012 | 5.4% |
| 1 | 1963 | 5.3% |
| 4 | 1815 | 4.9% |
| 5 | 1769 | 4.7% |
| 2 | 1763 | 4.7% |
| 6 | 1626 | 4.4% |
| 8 | 1552 | 4.2% |
| Other values (2) | 3032 | 8.1% |
CLM_FREQ
Real number (ℝ)
| Distinct | 6 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.80071831 |
| Minimum | 0 |
|---|---|
| Maximum | 5 |
| Zeros | 6292 |
| Zeros (%) | 61.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 80.6 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 2 |
| 95-th percentile | 3 |
| Maximum | 5 |
| Range | 5 |
| Interquartile range (IQR) | 2 |
Descriptive statistics
| Standard deviation | 1.1540786 |
|---|---|
| Coefficient of variation (CV) | 1.4413041 |
| Kurtosis | 0.24591814 |
| Mean | 0.80071831 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 1.1940624 |
| Sum | 8249 |
| Variance | 1.3318974 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 6292 | |
| 2 | 1492 | 14.5% |
| 1 | 1279 | 12.4% |
| 3 | 992 | 9.6% |
| 4 | 225 | 2.2% |
| 5 | 22 | 0.2% |
| Value | Count | Frequency (%) |
| 0 | 6292 | |
| 1 | 1279 | 12.4% |
| 2 | 1492 | 14.5% |
| 3 | 992 | 9.6% |
| 4 | 225 | 2.2% |
| 5 | 22 | 0.2% |
| Value | Count | Frequency (%) |
| 5 | 22 | 0.2% |
| 4 | 225 | 2.2% |
| 3 | 992 | 9.6% |
| 2 | 1492 | 14.5% |
| 1 | 1279 | 12.4% |
| 0 | 6292 |
REVOKED
Boolean
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 10.2 KiB |
| False | |
|---|---|
| True |
| Value | Count | Frequency (%) |
| False | 9041 | |
| True | 1261 | 12.2% |
MVR_PTS
Real number (ℝ)
| Distinct | 14 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.7101534 |
| Minimum | 0 |
|---|---|
| Maximum | 13 |
| Zeros | 4658 |
| Zeros (%) | 45.2% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 80.6 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 1 |
| Q3 | 3 |
| 95-th percentile | 6 |
| Maximum | 13 |
| Range | 13 |
| Interquartile range (IQR) | 3 |
Descriptive statistics
| Standard deviation | 2.1590149 |
|---|---|
| Coefficient of variation (CV) | 1.2624686 |
| Kurtosis | 1.3358371 |
| Mean | 1.7101534 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 1.3405063 |
| Sum | 17618 |
| Variance | 4.6613453 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 4658 | |
| 1 | 1467 | 14.2% |
| 2 | 1199 | 11.6% |
| 3 | 966 | 9.4% |
| 4 | 727 | 7.1% |
| 5 | 528 | 5.1% |
| 6 | 341 | 3.3% |
| 7 | 213 | 2.1% |
| 8 | 114 | 1.1% |
| 9 | 53 | 0.5% |
| Other values (4) | 36 | 0.3% |
| Value | Count | Frequency (%) |
| 0 | 4658 | |
| 1 | 1467 | 14.2% |
| 2 | 1199 | 11.6% |
| 3 | 966 | 9.4% |
| 4 | 727 | 7.1% |
| 5 | 528 | 5.1% |
| 6 | 341 | 3.3% |
| 7 | 213 | 2.1% |
| 8 | 114 | 1.1% |
| 9 | 53 | 0.5% |
| Value | Count | Frequency (%) |
| 13 | 2 | < 0.1% |
| 12 | 1 | < 0.1% |
| 11 | 13 | 0.1% |
| 10 | 20 | 0.2% |
| 9 | 53 | 0.5% |
| 8 | 114 | 1.1% |
| 7 | 213 | 2.1% |
| 6 | 341 | |
| 5 | 528 | |
| 4 | 727 |
CLM_AMT
Categorical
HIGH CARDINALITY  IMBALANCE 
| Distinct | 2346 |
|---|---|
| Distinct (%) | 22.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 80.6 KiB |
| $0 | |
|---|---|
| $2,327 | 4 |
| $3,674 | 4 |
| $3,350 | 4 |
| $4,363 | 4 |
| Other values (2341) |
Length
| Max length | 8 |
|---|---|
| Median length | 2 |
| Mean length | 3.0606678 |
| Min length | 2 |
Characters and Unicode
| Total characters | 31531 |
|---|---|
| Distinct characters | 12 |
| Distinct categories | 3 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 1996 ? |
|---|---|
| Unique (%) | 19.4% |
Sample
| 1st row | $0 |
|---|---|
| 2nd row | $0 |
| 3rd row | $0 |
| 4th row | $0 |
| 5th row | $0 |
Common Values
| Value | Count | Frequency (%) |
| $0 | 7556 | |
| $2,327 | 4 | < 0.1% |
| $3,674 | 4 | < 0.1% |
| $3,350 | 4 | < 0.1% |
| $4,363 | 4 | < 0.1% |
| $3,667 | 4 | < 0.1% |
| $5,900 | 3 | < 0.1% |
| $4,506 | 3 | < 0.1% |
| $6,409 | 3 | < 0.1% |
| $5,951 | 3 | < 0.1% |
| Other values (2336) | 2714 | 26.3% |
Length
| Value | Count | Frequency (%) |
| 0 | 7556 | |
| 3,674 | 4 | < 0.1% |
| 3,350 | 4 | < 0.1% |
| 4,363 | 4 | < 0.1% |
| 3,667 | 4 | < 0.1% |
| 2,327 | 4 | < 0.1% |
| 6,879 | 3 | < 0.1% |
| 2,493 | 3 | < 0.1% |
| 1,479 | 3 | < 0.1% |
| 3,858 | 3 | < 0.1% |
| Other values (2336) | 2714 | 26.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| $ | 10302 | |
| 0 | 8433 | |
| , | 2620 | 8.3% |
| 3 | 1378 | 4.4% |
| 4 | 1297 | 4.1% |
| 2 | 1281 | 4.1% |
| 1 | 1220 | 3.9% |
| 5 | 1205 | 3.8% |
| 6 | 1039 | 3.3% |
| 7 | 946 | 3.0% |
| Other values (2) | 1810 | 5.7% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 18609 | |
| Currency Symbol | 10302 | |
| Other Punctuation | 2620 | 8.3% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 8433 | |
| 3 | 1378 | 7.4% |
| 4 | 1297 | 7.0% |
| 2 | 1281 | 6.9% |
| 1 | 1220 | 6.6% |
| 5 | 1205 | 6.5% |
| 6 | 1039 | 5.6% |
| 7 | 946 | 5.1% |
| 8 | 923 | 5.0% |
| 9 | 887 | 4.8% |
Currency Symbol
| Value | Count | Frequency (%) |
| $ | 10302 |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 2620 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 31531 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| $ | 10302 | |
| 0 | 8433 | |
| , | 2620 | 8.3% |
| 3 | 1378 | 4.4% |
| 4 | 1297 | 4.1% |
| 2 | 1281 | 4.1% |
| 1 | 1220 | 3.9% |
| 5 | 1205 | 3.8% |
| 6 | 1039 | 3.3% |
| 7 | 946 | 3.0% |
| Other values (2) | 1810 | 5.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 31531 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| $ | 10302 | |
| 0 | 8433 | |
| , | 2620 | 8.3% |
| 3 | 1378 | 4.4% |
| 4 | 1297 | 4.1% |
| 2 | 1281 | 4.1% |
| 1 | 1220 | 3.9% |
| 5 | 1205 | 3.8% |
| 6 | 1039 | 3.3% |
| 7 | 946 | 3.0% |
| Other values (2) | 1810 | 5.7% |
CAR_AGE
Real number (ℝ)
| Distinct | 30 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 639 |
| Missing (%) | 6.2% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 8.2981476 |
| Minimum | -3 |
|---|---|
| Maximum | 28 |
| Zeros | 4 |
| Zeros (%) | < 0.1% |
| Negative | 1 |
| Negative (%) | < 0.1% |
| Memory size | 80.6 KiB |
Quantile statistics
| Minimum | -3 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| median | 8 |
| Q3 | 12 |
| 95-th percentile | 18 |
| Maximum | 28 |
| Range | 31 |
| Interquartile range (IQR) | 11 |
Descriptive statistics
| Standard deviation | 5.7144502 |
|---|---|
| Coefficient of variation (CV) | 0.68864167 |
| Kurtosis | -0.76432996 |
| Mean | 8.2981476 |
| Median Absolute Deviation (MAD) | 5 |
| Skewness | 0.28046053 |
| Sum | 80185 |
| Variance | 32.654941 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 2489 | |
| 8 | 696 | 6.8% |
| 9 | 659 | 6.4% |
| 7 | 655 | 6.4% |
| 10 | 600 | 5.8% |
| 6 | 552 | 5.4% |
| 11 | 549 | 5.3% |
| 12 | 467 | 4.5% |
| 13 | 450 | 4.4% |
| 14 | 396 | 3.8% |
| Other values (20) | 2150 | |
| (Missing) | 639 | 6.2% |
| Value | Count | Frequency (%) |
| -3 | 1 | < 0.1% |
| 0 | 4 | < 0.1% |
| 1 | 2489 | |
| 2 | 18 | 0.2% |
| 3 | 70 | 0.7% |
| 4 | 169 | 1.6% |
| 5 | 360 | 3.5% |
| 6 | 552 | 5.4% |
| 7 | 655 | 6.4% |
| 8 | 696 | 6.8% |
| Value | Count | Frequency (%) |
| 28 | 1 | < 0.1% |
| 27 | 1 | < 0.1% |
| 26 | 3 | < 0.1% |
| 25 | 8 | 0.1% |
| 24 | 13 | 0.1% |
| 23 | 22 | 0.2% |
| 22 | 33 | 0.3% |
| 21 | 65 | |
| 20 | 112 | |
| 19 | 155 |
CLAIM_FLAG
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 80.6 KiB |
| 0 | |
|---|---|
| 1 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 10302 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 7556 | |
| 1 | 2746 | 26.7% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0 | 7556 | |
| 1 | 2746 | 26.7% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 7556 | |
| 1 | 2746 | 26.7% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 10302 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 7556 | |
| 1 | 2746 | 26.7% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 10302 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 7556 | |
| 1 | 2746 | 26.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 10302 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 7556 | |
| 1 | 2746 | 26.7% |
| ID | AGE | HOMEKIDS | YOJ | TRAVTIME | TIF | CLM_FREQ | MVR_PTS | CAR_AGE | KIDSDRIV | PARENT1 | MSTATUS | GENDER | EDUCATION | OCCUPATION | CAR_USE | CAR_TYPE | RED_CAR | REVOKED | CLAIM_FLAG | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| ID | 1.000 | -0.013 | 0.009 | -0.005 | -0.001 | -0.012 | 0.001 | 0.010 | -0.003 | 0.018 | 0.010 | 0.014 | 0.004 | 0.021 | 0.012 | 0.000 | 0.000 | 0.022 | 0.000 | 0.000 |
| AGE | -0.013 | 1.000 | -0.515 | 0.149 | -0.001 | -0.002 | -0.047 | -0.062 | 0.185 | 0.166 | 0.326 | 0.090 | 0.076 | 0.126 | 0.108 | 0.063 | 0.095 | 0.071 | 0.045 | 0.159 |
| HOMEKIDS | 0.009 | -0.515 | 1.000 | 0.137 | -0.003 | 0.003 | 0.055 | 0.055 | -0.167 | 0.314 | 0.528 | 0.043 | 0.128 | 0.106 | 0.108 | 0.000 | 0.053 | 0.076 | 0.044 | 0.134 |
| YOJ | -0.005 | 0.149 | 0.137 | 1.000 | -0.010 | 0.013 | -0.018 | -0.032 | 0.040 | 0.077 | 0.061 | 0.247 | 0.115 | 0.066 | 0.241 | 0.058 | 0.070 | 0.077 | 0.000 | 0.083 |
| TRAVTIME | -0.001 | -0.001 | -0.003 | -0.010 | 1.000 | -0.009 | 0.012 | 0.010 | -0.033 | 0.024 | 0.029 | 0.014 | 0.000 | 0.029 | 0.041 | 0.000 | 0.000 | 0.000 | 0.000 | 0.062 |
| TIF | -0.012 | -0.002 | 0.003 | 0.013 | -0.009 | 1.000 | -0.021 | -0.035 | 0.001 | 0.000 | 0.019 | 0.000 | 0.015 | 0.000 | 0.006 | 0.000 | 0.003 | 0.023 | 0.027 | 0.085 |
| CLM_FREQ | 0.001 | -0.047 | 0.055 | -0.018 | 0.012 | -0.021 | 1.000 | 0.418 | -0.024 | 0.022 | 0.065 | 0.070 | 0.013 | 0.028 | 0.031 | 0.080 | 0.039 | 0.025 | 0.075 | 0.249 |
| MVR_PTS | 0.010 | -0.062 | 0.055 | -0.032 | 0.010 | -0.035 | 0.418 | 1.000 | -0.020 | 0.030 | 0.072 | 0.046 | 0.000 | 0.026 | 0.028 | 0.066 | 0.027 | 0.011 | 0.056 | 0.223 |
| CAR_AGE | -0.003 | 0.185 | -0.167 | 0.040 | -0.033 | 0.001 | -0.024 | -0.020 | 1.000 | 0.025 | 0.068 | 0.033 | 0.030 | 0.425 | 0.235 | 0.092 | 0.057 | 0.025 | 0.030 | 0.110 |
| KIDSDRIV | 0.018 | 0.166 | 0.314 | 0.077 | 0.024 | 0.000 | 0.022 | 0.030 | 0.025 | 1.000 | 0.231 | 0.039 | 0.047 | 0.038 | 0.043 | 0.000 | 0.020 | 0.045 | 0.035 | 0.112 |
| PARENT1 | 0.010 | 0.326 | 0.528 | 0.061 | 0.029 | 0.019 | 0.065 | 0.072 | 0.068 | 0.231 | 1.000 | 0.474 | 0.068 | 0.090 | 0.093 | 0.000 | 0.055 | 0.043 | 0.049 | 0.158 |
| MSTATUS | 0.014 | 0.090 | 0.043 | 0.247 | 0.014 | 0.000 | 0.070 | 0.046 | 0.033 | 0.039 | 0.474 | 1.000 | 0.000 | 0.046 | 0.030 | 0.006 | 0.000 | 0.009 | 0.039 | 0.129 |
| GENDER | 0.004 | 0.076 | 0.128 | 0.115 | 0.000 | 0.015 | 0.013 | 0.000 | 0.030 | 0.047 | 0.068 | 0.000 | 1.000 | 0.050 | 0.251 | 0.282 | 0.717 | 0.663 | 0.005 | 0.019 |
| EDUCATION | 0.021 | 0.126 | 0.106 | 0.066 | 0.029 | 0.000 | 0.028 | 0.026 | 0.425 | 0.038 | 0.090 | 0.046 | 0.050 | 1.000 | 0.560 | 0.217 | 0.094 | 0.030 | 0.017 | 0.151 |
| OCCUPATION | 0.012 | 0.108 | 0.108 | 0.241 | 0.041 | 0.006 | 0.031 | 0.028 | 0.235 | 0.043 | 0.093 | 0.030 | 0.251 | 0.560 | 1.000 | 0.573 | 0.136 | 0.173 | 0.026 | 0.188 |
| CAR_USE | 0.000 | 0.063 | 0.000 | 0.058 | 0.000 | 0.000 | 0.080 | 0.066 | 0.092 | 0.000 | 0.000 | 0.006 | 0.282 | 0.217 | 0.573 | 1.000 | 0.539 | 0.189 | 0.006 | 0.136 |
| CAR_TYPE | 0.000 | 0.095 | 0.053 | 0.070 | 0.000 | 0.003 | 0.039 | 0.027 | 0.057 | 0.020 | 0.055 | 0.000 | 0.717 | 0.094 | 0.136 | 0.539 | 1.000 | 0.486 | 0.033 | 0.134 |
| RED_CAR | 0.022 | 0.071 | 0.076 | 0.077 | 0.000 | 0.023 | 0.025 | 0.011 | 0.025 | 0.045 | 0.043 | 0.009 | 0.663 | 0.030 | 0.173 | 0.189 | 0.486 | 1.000 | 0.000 | 0.000 |
| REVOKED | 0.000 | 0.045 | 0.044 | 0.000 | 0.000 | 0.027 | 0.075 | 0.056 | 0.030 | 0.035 | 0.049 | 0.039 | 0.005 | 0.017 | 0.026 | 0.006 | 0.033 | 0.000 | 1.000 | 0.155 |
| CLAIM_FLAG | 0.000 | 0.159 | 0.134 | 0.083 | 0.062 | 0.085 | 0.249 | 0.223 | 0.110 | 0.112 | 0.158 | 0.129 | 0.019 | 0.151 | 0.188 | 0.136 | 0.134 | 0.000 | 0.155 | 1.000 |
| ID | KIDSDRIV | AGE | HOMEKIDS | YOJ | INCOME | PARENT1 | HOME_VAL | MSTATUS | GENDER | EDUCATION | OCCUPATION | TRAVTIME | CAR_USE | BLUEBOOK | TIF | CAR_TYPE | RED_CAR | OLDCLAIM | CLM_FREQ | REVOKED | MVR_PTS | CLM_AMT | CAR_AGE | CLAIM_FLAG | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 63581743 | 0 | 60.0 | 0 | 11.0 | $67,349 | No | $0 | z_No | M | PhD | Professional | 14 | Private | $14,230 | 11 | Minivan | yes | $4,461 | 2 | No | 3 | $0 | 18.0 | 0 |
| 1 | 132761049 | 0 | 43.0 | 0 | 11.0 | $91,449 | No | $257,252 | z_No | M | z_High School | z_Blue Collar | 22 | Commercial | $14,940 | 1 | Minivan | yes | $0 | 0 | No | 0 | $0 | 1.0 | 0 |
| 2 | 921317019 | 0 | 48.0 | 0 | 11.0 | $52,881 | No | $0 | z_No | M | Bachelors | Manager | 26 | Private | $21,970 | 1 | Van | yes | $0 | 0 | No | 2 | $0 | 10.0 | 0 |
| 3 | 727598473 | 0 | 35.0 | 1 | 10.0 | $16,039 | No | $124,191 | Yes | z_F | z_High School | Clerical | 5 | Private | $4,010 | 4 | z_SUV | no | $38,690 | 2 | No | 3 | $0 | 10.0 | 0 |
| 4 | 450221861 | 0 | 51.0 | 0 | 14.0 | NaN | No | $306,251 | Yes | M | <High School | z_Blue Collar | 32 | Private | $15,440 | 7 | Minivan | yes | $0 | 0 | No | 0 | $0 | 6.0 | 0 |
| 5 | 743146596 | 0 | 50.0 | 0 | NaN | $114,986 | No | $243,925 | Yes | z_F | PhD | Doctor | 36 | Private | $18,000 | 1 | z_SUV | no | $19,217 | 2 | Yes | 3 | $0 | 17.0 | 0 |
| 6 | 871024631 | 0 | 34.0 | 1 | 12.0 | $125,301 | Yes | $0 | z_No | z_F | Bachelors | z_Blue Collar | 46 | Commercial | $17,430 | 1 | Sports Car | no | $0 | 0 | No | 0 | $2,946 | 7.0 | 1 |
| 7 | 792300541 | 0 | 54.0 | 0 | NaN | $18,755 | No | NaN | Yes | z_F | <High School | z_Blue Collar | 33 | Private | $8,780 | 1 | z_SUV | no | $0 | 0 | No | 0 | $0 | 1.0 | 0 |
| 8 | 7945239 | 1 | 40.0 | 1 | 11.0 | $50,815 | Yes | $0 | z_No | M | z_High School | Manager | 21 | Private | $18,930 | 6 | Minivan | no | $3,295 | 1 | No | 2 | $6,477 | 1.0 | 1 |
| 9 | 3577610 | 0 | 44.0 | 2 | 12.0 | $43,486 | Yes | $0 | z_No | z_F | z_High School | z_Blue Collar | 30 | Commercial | $5,900 | 10 | z_SUV | no | $0 | 0 | No | 0 | $0 | 10.0 | 0 |
| ID | KIDSDRIV | AGE | HOMEKIDS | YOJ | INCOME | PARENT1 | HOME_VAL | MSTATUS | GENDER | EDUCATION | OCCUPATION | TRAVTIME | CAR_USE | BLUEBOOK | TIF | CAR_TYPE | RED_CAR | OLDCLAIM | CLM_FREQ | REVOKED | MVR_PTS | CLM_AMT | CAR_AGE | CLAIM_FLAG | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 10292 | 452807843 | 0 | 48.0 | 0 | 10.0 | $111,305 | No | $0 | z_No | z_F | PhD | Doctor | 59 | Private | $17,430 | 13 | z_SUV | no | $0 | 0 | No | 4 | $0 | 18.0 | 0 |
| 10293 | 814422920 | 0 | 51.0 | 0 | 10.0 | $128,523 | No | $0 | z_No | M | Masters | NaN | 18 | Commercial | $32,960 | 6 | Panel Truck | no | $3,995 | 3 | No | 1 | $3,288 | 15.0 | 1 |
| 10294 | 721196389 | 1 | 38.0 | 4 | 16.0 | $12,717 | No | $0 | Yes | z_F | Bachelors | Student | 15 | Commercial | $24,740 | 1 | Pickup | no | $9,245 | 3 | No | 3 | $0 | 15.0 | 0 |
| 10295 | 215633551 | 0 | 41.0 | 0 | 7.0 | $6,256 | No | $0 | z_No | M | z_High School | Student | 41 | Private | $5,600 | 1 | Pickup | no | $0 | 0 | No | 0 | $0 | 7.0 | 0 |
| 10296 | 121441578 | 0 | 35.0 | 0 | 11.0 | $43,112 | No | $0 | z_No | M | z_High School | z_Blue Collar | 51 | Commercial | $27,330 | 10 | Panel Truck | yes | $0 | 0 | No | 0 | $0 | 8.0 | 0 |
| 10297 | 67790126 | 1 | 45.0 | 2 | 9.0 | $164,669 | No | $386,273 | Yes | M | PhD | Manager | 21 | Private | $13,270 | 15 | Minivan | no | $0 | 0 | No | 2 | $0 | 17.0 | 0 |
| 10298 | 61970712 | 0 | 46.0 | 0 | 9.0 | $107,204 | No | $332,591 | Yes | M | Masters | NaN | 36 | Commercial | $24,490 | 6 | Panel Truck | no | $0 | 0 | No | 0 | $0 | 1.0 | 0 |
| 10299 | 849208064 | 0 | 48.0 | 0 | 15.0 | $39,837 | No | $170,611 | Yes | z_F | <High School | z_Blue Collar | 12 | Private | $13,820 | 7 | z_SUV | no | $0 | 0 | No | 0 | $0 | 1.0 | 0 |
| 10300 | 627828331 | 0 | 50.0 | 0 | 7.0 | $43,445 | No | $149,248 | Yes | z_F | Bachelors | Home Maker | 36 | Private | $22,550 | 6 | Minivan | no | $0 | 0 | No | 0 | $0 | 11.0 | 0 |
| 10301 | 680381960 | 0 | 52.0 | 0 | 11.0 | $53,235 | No | $197,017 | Yes | z_F | z_High School | Clerical | 64 | Private | $19,400 | 6 | Minivan | no | $0 | 0 | No | 0 | $0 | 9.0 | 0 |
Most frequently occurring
| ID | KIDSDRIV | AGE | HOMEKIDS | YOJ | INCOME | PARENT1 | HOME_VAL | MSTATUS | GENDER | EDUCATION | OCCUPATION | TRAVTIME | CAR_USE | BLUEBOOK | TIF | CAR_TYPE | RED_CAR | OLDCLAIM | CLM_FREQ | REVOKED | MVR_PTS | CLM_AMT | CAR_AGE | CLAIM_FLAG | # duplicates | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 279799481 | 0 | 39.0 | 0 | 14.0 | $93,077 | No | $244,764 | Yes | M | Bachelors | Professional | 29 | Private | $14,710 | 1 | Minivan | yes | $0 | 0 | No | 0 | $0 | 1.0 | 0 | 2 |